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What is claimed is: 

1 . A method of detecting script language viruses in data streams comprising: 
preparing language description data corresponding to at least one script language; 
preparing detection data for viral code corresponding to the script language virus; 

5 and 

lexically analyzing a data stream using the language description data and the 
detection data to detect the viral code. 

2. The method of claim 1, wherein the language description data correspond to 
10 Dynamic Finite Automata data. 

~~. 3. The method of claim 2, wherein the Dynamic Finite Automata data comprises 

a set of states, with each state having a corresponding set of transitions and each 
transition having an associated character to be matched and an associated next state. 

15 

4. The method of claim 1, wherein the language description data correspond to 
language definition rules and language check rules. 

5. The method of claim 4, wherein the lexical analysis includes one or more 
20 pattern matches based on the language definition rules. 

6. The method of claim 4, wherein a script language used by the data stream is 
determined by the lexical analysis using the language check rules. 

25 7. The method of claim 1 further comprising setting language definition rules for 

each of the at least one script language. 

8. The method of claim 1, wherein the detection data comprise at least one test, 
wherein each of the at least one test correspond to a pattern match or a cyclical 
30 redundancy check. 
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9. The method of claim 1, wherein the step of preparing detection data 
comprises: 

obtaining samples of the viral code; 
analyzing the obtained samples; and 
5 setting a detection regimen that includes at least one pattern match or cyclical 

redundancy check based on the analysis of the obtained samples. 

10. The method of claim 1, wherein the data stream is converted to a stream of 
tokens using lexical analysis. 

10 

11. The method of claim 10, wherein the tokens correspond to respective 
language constructs. 

12. The method of claim 10, wherein a cyclical redundancy check is performed 
15 on the stream of tokens to detect viral code. 

13. A program storage device readable by a machine, tangibly embodying a 
program of instructions executable by the machine to perform method steps for detecting 
script language viruses, the method steps comprising: 

20 preparing language description data corresponding to at least one script language; 

preparing detection data for viral code corresponding to the script language virus; 

and 

lexically analyzing a data stream using the language description data and the 
detection data to detect the viral code. 

25 

14. A computer system, comprising: 
a processor; and 

a program storage device readable by the computer system, tangibly embodying 
a program of instructions executable by the processor to perform method steps for 
30 detecting script language viruses, the method steps comprising: 
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preparing language description data corresponding to at least one script 

language; 

preparing detection data for viral code corresponding to the script 
language virus; and 

5 lexically analyzing a data stream using the language description data and 

the detection data to detect the viral code. 

15. A computer data signal embodied in a transmission medium which embodies 
instructions executable by a computer for detecting a script language virus, comprising: 

10 a first segment including script language processor code to prepare language 

description data corresponding to at least one script language; 

a second segment including detection data processor code to prepare detection 
data for viral code corresponding to the script language virus; and 

a third segment including detection engine code to lexically analyze a data stream 
1 5 using the language description data and the detection data to detect viral code. 

16. An apparatus for detecting script language viruses, comprising: 

a script language processor, wherein the script language processor prepares 
language description data corresponding to at least one script language; 
20 a detection data processor, wherein the detection data processor prepares detection 

data for viral code corresponding to a script language virus; and 

a detection engine, wherein the detection engine lexically analyzes a data stream 
using the language description data and the detection data to detect the viral code. 

25 1 7. The apparatus of claim 16, wherein the language description data correspond 

to Dynamic Finite Automata data. 

18. The apparatus of claim 17, wherein the Dynamic Finite Automata data 
comprises at least one set of states, with each state having a corresponding set of 
30 transitions and each transition having an associated character to be matched and an 
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associated next state. 

19. The apparatus of claim 16, wherein the language description data correspond 
to language definition rules and language check rules. 

5 

20. The apparatus of claim 19, wherein the lexical analysis by the detection 
engine includes one or more pattern matches based on the language definition rules. 

21. The apparatus of claim 19, wherein a script language used by the data stream 
10 is determined by the lexical analysis of the detection engine using the language check 

rules. 

22. The apparatus of claim 16, wherein the detection data comprise at least one 
test, and each of the at least one test correspond to a pattern match or a cyclical 

15 redundancy check. 

23. The apparatus of claim 16, wherein detection engine converts the data stream 
to a stream of tokens using lexical analysis, and the tokens correspond to respective 
language constructs. 
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